Dataset statistics
| Number of variables | 9 |
|---|---|
| Number of observations | 1030 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 11 |
| Duplicate rows (%) | 1.1% |
| Total size in memory | 72.5 KiB |
| Average record size in memory | 72.1 B |
Variable types
| Numeric | 9 |
|---|
| Dataset has 11 (1.1%) duplicate rows | Duplicates |
water is highly overall correlated with superplasticizer | High correlation |
superplasticizer is highly overall correlated with water | High correlation |
age is highly overall correlated with concrete_compressive_strength | High correlation |
concrete_compressive_strength is highly overall correlated with age | High correlation |
blast_furnace_slag has 471 (45.7%) zeros | Zeros |
fly_ash has 566 (55.0%) zeros | Zeros |
superplasticizer has 379 (36.8%) zeros | Zeros |
Reproduction
| Analysis started | 2023-06-03 00:13:09.227007 |
|---|---|
| Analysis finished | 2023-06-03 00:13:32.072294 |
| Duration | 22.85 seconds |
| Software version | pandas-profiling v3.6.6 |
| Download configuration | config.json |
cement
Real number (ℝ)
| Distinct | 278 |
|---|---|
| Distinct (%) | 27.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 281.16786 |
| Minimum | 102 |
|---|---|
| Maximum | 540 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.2 KiB |
Quantile statistics
| Minimum | 102 |
|---|---|
| 5-th percentile | 143.745 |
| Q1 | 192.375 |
| median | 272.9 |
| Q3 | 350 |
| 95-th percentile | 480 |
| Maximum | 540 |
| Range | 438 |
| Interquartile range (IQR) | 157.625 |
Descriptive statistics
| Standard deviation | 104.50636 |
|---|---|
| Coefficient of variation (CV) | 0.37168673 |
| Kurtosis | -0.52065228 |
| Mean | 281.16786 |
| Median Absolute Deviation (MAD) | 79.4 |
| Skewness | 0.50948118 |
| Sum | 289602.9 |
| Variance | 10921.58 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 362.6 | 20 | 1.9% |
| 425 | 20 | 1.9% |
| 251.4 | 15 | 1.5% |
| 310 | 14 | 1.4% |
| 446 | 14 | 1.4% |
| 331 | 13 | 1.3% |
| 475 | 13 | 1.3% |
| 250 | 13 | 1.3% |
| 349 | 12 | 1.2% |
| 387 | 12 | 1.2% |
| Other values (268) | 884 |
| Value | Count | Frequency (%) |
| 102 | 4 | |
| 108.3 | 4 | |
| 116 | 4 | |
| 122.6 | 4 | |
| 132 | 2 | 0.2% |
| 133 | 5 | |
| 133.1 | 1 | 0.1% |
| 134.7 | 1 | 0.1% |
| 135 | 2 | 0.2% |
| 135.7 | 2 | 0.2% |
| Value | Count | Frequency (%) |
| 540 | 9 | |
| 531.3 | 5 | |
| 528 | 1 | 0.1% |
| 525 | 7 | |
| 522 | 2 | 0.2% |
| 520 | 2 | 0.2% |
| 516 | 2 | 0.2% |
| 505 | 1 | 0.1% |
| 500.1 | 1 | 0.1% |
| 500 | 10 |
blast_furnace_slag
Real number (ℝ)
| Distinct | 185 |
|---|---|
| Distinct (%) | 18.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 73.895825 |
| Minimum | 0 |
|---|---|
| Maximum | 359.4 |
| Zeros | 471 |
| Zeros (%) | 45.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 22 |
| Q3 | 142.95 |
| 95-th percentile | 236 |
| Maximum | 359.4 |
| Range | 359.4 |
| Interquartile range (IQR) | 142.95 |
Descriptive statistics
| Standard deviation | 86.279342 |
|---|---|
| Coefficient of variation (CV) | 1.1675807 |
| Kurtosis | -0.50817548 |
| Mean | 73.895825 |
| Median Absolute Deviation (MAD) | 22 |
| Skewness | 0.8007169 |
| Sum | 76112.7 |
| Variance | 7444.1248 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 471 | |
| 189 | 30 | 2.9% |
| 106.3 | 20 | 1.9% |
| 24 | 14 | 1.4% |
| 20 | 12 | 1.2% |
| 145 | 11 | 1.1% |
| 98.1 | 10 | 1.0% |
| 19 | 10 | 1.0% |
| 26 | 8 | 0.8% |
| 22 | 8 | 0.8% |
| Other values (175) | 436 |
| Value | Count | Frequency (%) |
| 0 | 471 | |
| 11 | 4 | 0.4% |
| 13.6 | 5 | 0.5% |
| 15 | 5 | 0.5% |
| 17.2 | 1 | 0.1% |
| 17.5 | 1 | 0.1% |
| 17.6 | 1 | 0.1% |
| 19 | 10 | 1.0% |
| 20 | 12 | 1.2% |
| 22 | 8 | 0.8% |
| Value | Count | Frequency (%) |
| 359.4 | 2 | 0.2% |
| 342.1 | 2 | 0.2% |
| 316.1 | 2 | 0.2% |
| 305.3 | 4 | |
| 290.2 | 2 | 0.2% |
| 288 | 4 | |
| 282.8 | 4 | |
| 272.8 | 2 | 0.2% |
| 262.2 | 5 | |
| 260 | 1 | 0.1% |
fly_ash
Real number (ℝ)
| Distinct | 156 |
|---|---|
| Distinct (%) | 15.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.18835 |
| Minimum | 0 |
|---|---|
| Maximum | 200.1 |
| Zeros | 566 |
| Zeros (%) | 55.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 118.3 |
| 95-th percentile | 167 |
| Maximum | 200.1 |
| Range | 200.1 |
| Interquartile range (IQR) | 118.3 |
Descriptive statistics
| Standard deviation | 63.997004 |
|---|---|
| Coefficient of variation (CV) | 1.1810104 |
| Kurtosis | -1.3287464 |
| Mean | 54.18835 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.53735391 |
| Sum | 55814 |
| Variance | 4095.6165 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 566 | |
| 118.3 | 20 | 1.9% |
| 141 | 16 | 1.6% |
| 24.5 | 15 | 1.5% |
| 79 | 14 | 1.4% |
| 94 | 13 | 1.3% |
| 100.4 | 11 | 1.1% |
| 125.2 | 10 | 1.0% |
| 95.7 | 10 | 1.0% |
| 98.8 | 10 | 1.0% |
| Other values (146) | 345 |
| Value | Count | Frequency (%) |
| 0 | 566 | |
| 24.5 | 15 | 1.5% |
| 59 | 1 | 0.1% |
| 60 | 1 | 0.1% |
| 71 | 1 | 0.1% |
| 71.5 | 1 | 0.1% |
| 75.6 | 1 | 0.1% |
| 76 | 1 | 0.1% |
| 77 | 2 | 0.2% |
| 78 | 2 | 0.2% |
| Value | Count | Frequency (%) |
| 200.1 | 1 | 0.1% |
| 200 | 1 | 0.1% |
| 195 | 3 | |
| 194.9 | 1 | 0.1% |
| 194 | 1 | 0.1% |
| 193 | 1 | 0.1% |
| 190 | 1 | 0.1% |
| 187 | 1 | 0.1% |
| 185.3 | 1 | 0.1% |
| 185 | 2 |
water
Real number (ℝ)
| Distinct | 195 |
|---|---|
| Distinct (%) | 18.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 181.56728 |
| Minimum | 121.8 |
|---|---|
| Maximum | 247 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.2 KiB |
Quantile statistics
| Minimum | 121.8 |
|---|---|
| 5-th percentile | 146.1 |
| Q1 | 164.9 |
| median | 185 |
| Q3 | 192 |
| 95-th percentile | 228 |
| Maximum | 247 |
| Range | 125.2 |
| Interquartile range (IQR) | 27.1 |
Descriptive statistics
| Standard deviation | 21.354219 |
|---|---|
| Coefficient of variation (CV) | 0.1176105 |
| Kurtosis | 0.12208167 |
| Mean | 181.56728 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 0.074628384 |
| Sum | 187014.3 |
| Variance | 456.00265 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 192 | 118 | 11.5% |
| 228 | 54 | 5.2% |
| 185.7 | 46 | 4.5% |
| 203.5 | 36 | 3.5% |
| 186 | 28 | 2.7% |
| 164.9 | 20 | 1.9% |
| 162 | 20 | 1.9% |
| 185 | 15 | 1.5% |
| 153.5 | 15 | 1.5% |
| 200 | 14 | 1.4% |
| Other values (185) | 664 |
| Value | Count | Frequency (%) |
| 121.8 | 5 | |
| 126.6 | 5 | |
| 127 | 1 | 0.1% |
| 127.3 | 1 | 0.1% |
| 137.8 | 5 | |
| 140 | 1 | 0.1% |
| 140.8 | 5 | |
| 141.8 | 5 | |
| 142 | 1 | 0.1% |
| 143.3 | 5 |
| Value | Count | Frequency (%) |
| 247 | 1 | 0.1% |
| 246.9 | 1 | 0.1% |
| 237 | 1 | 0.1% |
| 236.7 | 1 | 0.1% |
| 228 | 54 | |
| 221.4 | 1 | 0.1% |
| 221 | 2 | 0.2% |
| 220.1 | 1 | 0.1% |
| 220 | 2 | 0.2% |
| 219.7 | 1 | 0.1% |
superplasticizer
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 111 |
|---|---|
| Distinct (%) | 10.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.2046602 |
| Minimum | 0 |
|---|---|
| Maximum | 32.2 |
| Zeros | 379 |
| Zeros (%) | 36.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 6.4 |
| Q3 | 10.2 |
| 95-th percentile | 16.055 |
| Maximum | 32.2 |
| Range | 32.2 |
| Interquartile range (IQR) | 10.2 |
Descriptive statistics
| Standard deviation | 5.9738414 |
|---|---|
| Coefficient of variation (CV) | 0.96279912 |
| Kurtosis | 1.411269 |
| Mean | 6.2046602 |
| Median Absolute Deviation (MAD) | 5.3 |
| Skewness | 0.90720257 |
| Sum | 6390.8 |
| Variance | 35.686781 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 379 | |
| 11.6 | 37 | 3.6% |
| 8 | 27 | 2.6% |
| 7 | 19 | 1.8% |
| 6 | 17 | 1.7% |
| 9.9 | 16 | 1.6% |
| 8.9 | 16 | 1.6% |
| 7.8 | 16 | 1.6% |
| 9 | 16 | 1.6% |
| 10 | 15 | 1.5% |
| Other values (101) | 472 |
| Value | Count | Frequency (%) |
| 0 | 379 | |
| 1.7 | 4 | 0.4% |
| 1.9 | 1 | 0.1% |
| 2 | 1 | 0.1% |
| 2.2 | 1 | 0.1% |
| 2.5 | 2 | 0.2% |
| 3 | 6 | 0.6% |
| 3.1 | 1 | 0.1% |
| 3.4 | 3 | 0.3% |
| 3.6 | 5 | 0.5% |
| Value | Count | Frequency (%) |
| 32.2 | 5 | |
| 28.2 | 5 | |
| 23.4 | 5 | |
| 22.1 | 1 | 0.1% |
| 22 | 6 | |
| 20.8 | 1 | 0.1% |
| 20 | 1 | 0.1% |
| 19 | 1 | 0.1% |
| 18.8 | 1 | 0.1% |
| 18.6 | 5 |
coarse_aggregate
Real number (ℝ)
| Distinct | 284 |
|---|---|
| Distinct (%) | 27.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 972.91893 |
| Minimum | 801 |
|---|---|
| Maximum | 1145 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.2 KiB |
Quantile statistics
| Minimum | 801 |
|---|---|
| 5-th percentile | 842 |
| Q1 | 932 |
| median | 968 |
| Q3 | 1029.4 |
| 95-th percentile | 1104 |
| Maximum | 1145 |
| Range | 344 |
| Interquartile range (IQR) | 97.4 |
Descriptive statistics
| Standard deviation | 77.753954 |
|---|---|
| Coefficient of variation (CV) | 0.079918225 |
| Kurtosis | -0.5990161 |
| Mean | 972.91893 |
| Median Absolute Deviation (MAD) | 46.3 |
| Skewness | -0.040219745 |
| Sum | 1002106.5 |
| Variance | 6045.6774 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 932 | 57 | 5.5% |
| 852.1 | 45 | 4.4% |
| 944.7 | 30 | 2.9% |
| 968 | 29 | 2.8% |
| 1125 | 24 | 2.3% |
| 1047 | 19 | 1.8% |
| 967 | 19 | 1.8% |
| 974 | 12 | 1.2% |
| 942 | 12 | 1.2% |
| 938 | 12 | 1.2% |
| Other values (274) | 771 |
| Value | Count | Frequency (%) |
| 801 | 4 | |
| 801.1 | 1 | 0.1% |
| 801.4 | 1 | 0.1% |
| 811 | 2 | |
| 814 | 1 | 0.1% |
| 814.1 | 1 | 0.1% |
| 817.9 | 1 | 0.1% |
| 818 | 1 | 0.1% |
| 819 | 2 | |
| 819.2 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 1145 | 1 | 0.1% |
| 1134.3 | 5 | 0.5% |
| 1130 | 1 | 0.1% |
| 1125 | 24 | |
| 1124.4 | 2 | 0.2% |
| 1120 | 2 | 0.2% |
| 1119 | 2 | 0.2% |
| 1118.8 | 2 | 0.2% |
| 1118 | 1 | 0.1% |
| 1113 | 2 | 0.2% |
fine_aggregate
Real number (ℝ)
| Distinct | 302 |
|---|---|
| Distinct (%) | 29.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 773.58049 |
| Minimum | 594 |
|---|---|
| Maximum | 992.6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.2 KiB |
Quantile statistics
| Minimum | 594 |
|---|---|
| 5-th percentile | 613 |
| Q1 | 730.95 |
| median | 779.5 |
| Q3 | 824 |
| 95-th percentile | 898.09 |
| Maximum | 992.6 |
| Range | 398.6 |
| Interquartile range (IQR) | 93.05 |
Descriptive statistics
| Standard deviation | 80.17598 |
|---|---|
| Coefficient of variation (CV) | 0.10364271 |
| Kurtosis | -0.10217699 |
| Mean | 773.58049 |
| Median Absolute Deviation (MAD) | 45.5 |
| Skewness | -0.2530096 |
| Sum | 796787.9 |
| Variance | 6428.1878 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 755.8 | 30 | 2.9% |
| 594 | 30 | 2.9% |
| 670 | 23 | 2.2% |
| 613 | 22 | 2.1% |
| 801 | 16 | 1.6% |
| 746.6 | 15 | 1.5% |
| 887.1 | 15 | 1.5% |
| 712 | 14 | 1.4% |
| 845 | 14 | 1.4% |
| 750 | 12 | 1.2% |
| Other values (292) | 839 |
| Value | Count | Frequency (%) |
| 594 | 30 | |
| 605 | 5 | 0.5% |
| 611.8 | 5 | 0.5% |
| 612 | 1 | 0.1% |
| 613 | 22 | |
| 613.2 | 2 | 0.2% |
| 614 | 1 | 0.1% |
| 623 | 2 | 0.2% |
| 630 | 5 | 0.5% |
| 631 | 4 | 0.4% |
| Value | Count | Frequency (%) |
| 992.6 | 5 | |
| 945 | 4 | |
| 943.1 | 4 | |
| 942 | 4 | |
| 925.7 | 5 | |
| 905.9 | 5 | |
| 903.8 | 5 | |
| 903.6 | 5 | |
| 901.8 | 5 | |
| 900.9 | 5 |
age
Real number (ℝ)
| Distinct | 14 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 45.662136 |
| Minimum | 1 |
|---|---|
| Maximum | 365 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 7 |
| median | 28 |
| Q3 | 56 |
| 95-th percentile | 180 |
| Maximum | 365 |
| Range | 364 |
| Interquartile range (IQR) | 49 |
Descriptive statistics
| Standard deviation | 63.169912 |
|---|---|
| Coefficient of variation (CV) | 1.38342 |
| Kurtosis | 12.168989 |
| Mean | 45.662136 |
| Median Absolute Deviation (MAD) | 21 |
| Skewness | 3.2691774 |
| Sum | 47032 |
| Variance | 3990.4377 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=14)
| Value | Count | Frequency (%) |
| 28 | 425 | |
| 3 | 134 | 13.0% |
| 7 | 126 | 12.2% |
| 56 | 91 | 8.8% |
| 14 | 62 | 6.0% |
| 90 | 54 | 5.2% |
| 100 | 52 | 5.0% |
| 180 | 26 | 2.5% |
| 91 | 22 | 2.1% |
| 365 | 14 | 1.4% |
| Other values (4) | 24 | 2.3% |
| Value | Count | Frequency (%) |
| 1 | 2 | 0.2% |
| 3 | 134 | 13.0% |
| 7 | 126 | 12.2% |
| 14 | 62 | 6.0% |
| 28 | 425 | |
| 56 | 91 | 8.8% |
| 90 | 54 | 5.2% |
| 91 | 22 | 2.1% |
| 100 | 52 | 5.0% |
| 120 | 3 | 0.3% |
| Value | Count | Frequency (%) |
| 365 | 14 | 1.4% |
| 360 | 6 | 0.6% |
| 270 | 13 | 1.3% |
| 180 | 26 | 2.5% |
| 120 | 3 | 0.3% |
| 100 | 52 | 5.0% |
| 91 | 22 | 2.1% |
| 90 | 54 | 5.2% |
| 56 | 91 | 8.8% |
| 28 | 425 |
concrete_compressive_strength
Real number (ℝ)
| Distinct | 845 |
|---|---|
| Distinct (%) | 82.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.817961 |
| Minimum | 2.33 |
|---|---|
| Maximum | 82.6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.2 KiB |
Quantile statistics
| Minimum | 2.33 |
|---|---|
| 5-th percentile | 10.961 |
| Q1 | 23.71 |
| median | 34.445 |
| Q3 | 46.135 |
| 95-th percentile | 66.802 |
| Maximum | 82.6 |
| Range | 80.27 |
| Interquartile range (IQR) | 22.425 |
Descriptive statistics
| Standard deviation | 16.705742 |
|---|---|
| Coefficient of variation (CV) | 0.46640684 |
| Kurtosis | -0.31372486 |
| Mean | 35.817961 |
| Median Absolute Deviation (MAD) | 10.93 |
| Skewness | 0.41697729 |
| Sum | 36892.5 |
| Variance | 279.08181 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 33.4 | 6 | 0.6% |
| 77.3 | 4 | 0.4% |
| 79.3 | 4 | 0.4% |
| 31.35 | 4 | 0.4% |
| 71.3 | 4 | 0.4% |
| 35.3 | 4 | 0.4% |
| 23.52 | 4 | 0.4% |
| 41.05 | 4 | 0.4% |
| 44.28 | 3 | 0.3% |
| 41.54 | 3 | 0.3% |
| Other values (835) | 990 |
| Value | Count | Frequency (%) |
| 2.33 | 1 | |
| 3.32 | 1 | |
| 4.57 | 1 | |
| 4.78 | 1 | |
| 4.83 | 1 | |
| 4.9 | 1 | |
| 6.27 | 1 | |
| 6.28 | 1 | |
| 6.47 | 1 | |
| 6.81 | 1 |
| Value | Count | Frequency (%) |
| 82.6 | 1 | 0.1% |
| 81.75 | 1 | 0.1% |
| 80.2 | 1 | 0.1% |
| 79.99 | 1 | 0.1% |
| 79.4 | 1 | 0.1% |
| 79.3 | 4 | |
| 78.8 | 1 | 0.1% |
| 77.3 | 4 | |
| 76.8 | 1 | 0.1% |
| 76.24 | 1 | 0.1% |
| cement | blast_furnace_slag | fly_ash | water | superplasticizer | coarse_aggregate | fine_aggregate | age | concrete_compressive_strength | |
|---|---|---|---|---|---|---|---|---|---|
| cement | 1.000 | -0.245 | -0.418 | -0.094 | 0.038 | -0.145 | -0.174 | 0.005 | 0.478 |
| blast_furnace_slag | -0.245 | 1.000 | -0.254 | 0.053 | 0.098 | -0.349 | -0.302 | -0.018 | 0.164 |
| fly_ash | -0.418 | -0.254 | 1.000 | -0.283 | 0.454 | 0.058 | 0.051 | 0.003 | -0.078 |
| water | -0.094 | 0.053 | -0.283 | 1.000 | -0.687 | -0.218 | -0.346 | 0.091 | -0.308 |
| superplasticizer | 0.038 | 0.098 | 0.454 | -0.687 | 1.000 | -0.199 | 0.168 | -0.010 | 0.348 |
| coarse_aggregate | -0.145 | -0.349 | 0.058 | -0.218 | -0.199 | 1.000 | -0.100 | -0.045 | -0.184 |
| fine_aggregate | -0.174 | -0.302 | 0.051 | -0.346 | 0.168 | -0.100 | 1.000 | -0.057 | -0.180 |
| age | 0.005 | -0.018 | 0.003 | 0.091 | -0.010 | -0.045 | -0.057 | 1.000 | 0.596 |
| concrete_compressive_strength | 0.478 | 0.164 | -0.078 | -0.308 | 0.348 | -0.184 | -0.180 | 0.596 | 1.000 |
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
| cement | blast_furnace_slag | fly_ash | water | superplasticizer | coarse_aggregate | fine_aggregate | age | concrete_compressive_strength | |
|---|---|---|---|---|---|---|---|---|---|
| 0 | 540.0 | 0.0 | 0.0 | 162.0 | 2.5 | 1040.0 | 676.0 | 28 | 79.99 |
| 1 | 540.0 | 0.0 | 0.0 | 162.0 | 2.5 | 1055.0 | 676.0 | 28 | 61.89 |
| 2 | 332.5 | 142.5 | 0.0 | 228.0 | 0.0 | 932.0 | 594.0 | 270 | 40.27 |
| 3 | 332.5 | 142.5 | 0.0 | 228.0 | 0.0 | 932.0 | 594.0 | 365 | 41.05 |
| 4 | 198.6 | 132.4 | 0.0 | 192.0 | 0.0 | 978.4 | 825.5 | 360 | 44.30 |
| 5 | 266.0 | 114.0 | 0.0 | 228.0 | 0.0 | 932.0 | 670.0 | 90 | 47.03 |
| 6 | 380.0 | 95.0 | 0.0 | 228.0 | 0.0 | 932.0 | 594.0 | 365 | 43.70 |
| 7 | 380.0 | 95.0 | 0.0 | 228.0 | 0.0 | 932.0 | 594.0 | 28 | 36.45 |
| 8 | 266.0 | 114.0 | 0.0 | 228.0 | 0.0 | 932.0 | 670.0 | 28 | 45.85 |
| 9 | 475.0 | 0.0 | 0.0 | 228.0 | 0.0 | 932.0 | 594.0 | 28 | 39.29 |
| cement | blast_furnace_slag | fly_ash | water | superplasticizer | coarse_aggregate | fine_aggregate | age | concrete_compressive_strength | |
|---|---|---|---|---|---|---|---|---|---|
| 1020 | 288.4 | 121.0 | 0.0 | 177.4 | 7.0 | 907.9 | 829.5 | 28 | 42.14 |
| 1021 | 298.2 | 0.0 | 107.0 | 209.7 | 11.1 | 879.6 | 744.2 | 28 | 31.88 |
| 1022 | 264.5 | 111.0 | 86.5 | 195.5 | 5.9 | 832.6 | 790.4 | 28 | 41.54 |
| 1023 | 159.8 | 250.0 | 0.0 | 168.4 | 12.2 | 1049.3 | 688.2 | 28 | 39.46 |
| 1024 | 166.0 | 259.7 | 0.0 | 183.2 | 12.7 | 858.8 | 826.8 | 28 | 37.92 |
| 1025 | 276.4 | 116.0 | 90.3 | 179.6 | 8.9 | 870.1 | 768.3 | 28 | 44.28 |
| 1026 | 322.2 | 0.0 | 115.6 | 196.0 | 10.4 | 817.9 | 813.4 | 28 | 31.18 |
| 1027 | 148.5 | 139.4 | 108.6 | 192.7 | 6.1 | 892.4 | 780.0 | 28 | 23.70 |
| 1028 | 159.1 | 186.7 | 0.0 | 175.6 | 11.3 | 989.6 | 788.9 | 28 | 32.77 |
| 1029 | 260.9 | 100.5 | 78.3 | 200.6 | 8.6 | 864.5 | 761.5 | 28 | 32.40 |
Most frequently occurring
| cement | blast_furnace_slag | fly_ash | water | superplasticizer | coarse_aggregate | fine_aggregate | age | concrete_compressive_strength | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 362.6 | 189.0 | 0.0 | 164.9 | 11.6 | 944.7 | 755.8 | 3 | 35.30 | 4 |
| 3 | 362.6 | 189.0 | 0.0 | 164.9 | 11.6 | 944.7 | 755.8 | 28 | 71.30 | 4 |
| 4 | 362.6 | 189.0 | 0.0 | 164.9 | 11.6 | 944.7 | 755.8 | 56 | 77.30 | 4 |
| 5 | 362.6 | 189.0 | 0.0 | 164.9 | 11.6 | 944.7 | 755.8 | 91 | 79.30 | 4 |
| 2 | 362.6 | 189.0 | 0.0 | 164.9 | 11.6 | 944.7 | 755.8 | 7 | 55.90 | 3 |
| 6 | 425.0 | 106.3 | 0.0 | 153.5 | 16.5 | 852.1 | 887.1 | 3 | 33.40 | 3 |
| 7 | 425.0 | 106.3 | 0.0 | 153.5 | 16.5 | 852.1 | 887.1 | 7 | 49.20 | 3 |
| 8 | 425.0 | 106.3 | 0.0 | 153.5 | 16.5 | 852.1 | 887.1 | 28 | 60.29 | 3 |
| 9 | 425.0 | 106.3 | 0.0 | 153.5 | 16.5 | 852.1 | 887.1 | 56 | 64.30 | 3 |
| 10 | 425.0 | 106.3 | 0.0 | 153.5 | 16.5 | 852.1 | 887.1 | 91 | 65.20 | 3 |